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(57) Digital watermarking of data, including image, 
video and audio data, is performed by repeatedly insert- 
ing the watermark into subregions or subimages of the 
data. Similarly, the watermark is repeatedly extracted 
from the subregions of the data. 



fATERHARK 
SIGNAL 



r20 



ERROR 

CORRECTION 
ENCODER 



c2 I 



PN MAPPER 



-22 



SPECTRAL 
TRANSFORUEIi 



rZ.3 



tATERUARK 

SEGUENTER 



WATERHARK 
INSERTER 



FIG 2 



Primed by Xerw (UK) Business Services 
2.16.3C4 



DATA TO BE" 
tATERHARICED 



DATA 

SEGMENTER 



^2 5 a 



TT 



:2 5 b 



fATERUARK 
INSERTED 



2 5 



VATERMARR 
INSERTER 



DATA 

COUBINER 



'2 6 



WATERUARXED 
DATA 



3NS0OCID; <EP 



1 



EP0840 513A2 



2 



Description 

The present invention relates to digital waternnark- 
ing of data including image, video and multimedia data. 
Specifically,, the invention relates to the insertion and 
extraction of embedded signals for purpose of vvater- 
marking, in which the insertion and extraction proce- 
dures are repeatedly applied to subregions of the data. 
When these subregions correspond to the 8x8 pixel 
blocks used for MPEG and JPEG compression and 
deconrpression, the watermarking procedure can be 
tightly coupled with these compression algorithrhs to 
achieve very significant savings in computation. 

The proliferation of digitized media such as image, 
video and multimedia Is creating a need for a security 
system which facilitates the identification of the source 
of the material. 

Content providers, i.e. owners of works in digital 
data form, have a need to embed signals into 
video/image/multimedia data which can subsequently 
be detected by software and or hardware devices for 
purposes of authenticating copyright ownership, control 
and management. 

For example, a coded signal might be inserted in 
data to indicate that the data should not be copied. The 
emt)edded signal should preserve the image fidelity, be 
robust to common signal transformations and resistant 
to tampering. In addition, consideration must be given to 
the data rate that can be provided by the system, 
though current requirements are relatively bw - a few 
bits per frame. 

in EP-A-766 468 con-esponding to U.S. latent 
Application Serial No. 08/534,894, filed September 28, 
1995, entitled "Secure Spread Spectrum Watermarking 
for Multinrtedia Data" and assigned to the same 
assignee as tie present invention, which is incorporated 
herein by reference, there was proposed a spread spec- 
trum watermarking method which embedded a water- 
mark signal into perceptually significant regions of an 
image for the purposes of identifying the content owner 
and/or possessor. A strength of this approach is that the 
watermark is very difficult to remove. In fact, this method 
only allows the watermark to be read if the original 
image or data is available for comparison. This is 
because the original spectrum of the watermark is 
shaped to that of the image through a non-linear multi- 
plicative procedure and this spectral shaping must be 
removed prior to detection by matched filtering and the 
watermark is Inserted into the N largest spectral coeffi- 
cients, the ranking of wt\\ch is not preserved after water- 
marking. Thus, this method does not allow software and 
hardware devices to directly read embedded signals. 

In an article by Cox et al., entitled " Secured Spec- 
trum Watermarking for Multimedia" available at 
htlpy/www.neci.nj.conr>/trAind6x.html (Technical Report 
No. 95-10) spread spectrum watermarking is described 
which embeds a pseudo-random noise sequence into 
the digital data for watermarking purposes. 



The above prior art watermark extraction methodol- 
ogy requires the original image spectrum be subtracted 
from the watermark image spectrum. This restricts the 
use of the method when there is no original image or 
5 original image spectrum available. One application 
where this presents a significant difficulty is for third 
party device providers desiring to read embedded infor- 
mation for operation or denying operation of such a 
device. 

10 In U.S Patent No. 5,319,735 by R.D. Preuss et al 
entitled "Embedded Signalling" digital information is 
encoded to produce a sequence of code symbols. The 
sequence of code symbols is embedded in an audio sig- 
nal by generating a corresponding sequence of spread 

75 spectrum code signals representing the sequence of 
code symbols. The frequency components of the code 
signal being essentially confined to a preselected sign- 
aling band lying witiiin the bandwidth of the audio signal 
and successive segments of the code signal con-e- 

20 spends to successive code symbols in the sequence. 
The audio signal is continuously frequency analyzed 
over a frequency band encompassing ttie signalling 
band and the code signal is dynamically filtered as a 
function of the analysis to provide a modified code sig- 

25 nal with frequency component levels which are, at each 
time instant, essentially a preselected proportion of the 
levels of the audio signal frequency components in cor- 
responding frequency ranges. The modified code signal 
and the audio signal are conrt)ined to provide a conpos- 

30 tte audio signal in which the digital information is 
embedded. Tliis component audio signal is then 
recorded on a recording medium or is otherwise sub- 
jected to a transmission channel. Two key elements of 
tills process are tiie spectral shaping and spectral 

35 equalization tiiat occur at the insertion and extraction 
stages, respectively, thereby allowing the embedded 
signal to be extracted without access to the unwater- 
• marked original data. 

In EP-patent application No. 97 11 53 53.1 con-e- 

40 spending to U.S. Patent Application Serial No. 
08/708.331 . filed September 4, 1996. entitled "A Spread 
Spectrum Watermark for Embedded Signaling" by Cox. 
and incorporated herein by reference, tiiere is 
described a method for extracting a watermark of 

45 embedded data from watermarked images or video 
without using an original or unwatermarked version of 
the data. This work can be viewed as an extension of 
the original work of Preuss et al from the audio domain 
to images and vkieo. 

50 This method of watermarking an image or image 
data for embedding signaling requires that the DCT 
(discrete cosine ti-anslbrm) and its inverse of the entire 
image be computed. There are fast algorithms for com- 
puting the DCT in N log A/ time, where N is ttie number 

55 of pixels in tiie image. However, for A/=5l2x512. tiie 
computational requirement Is still high, particularly if tiie 
encoding and extracting processes must occur at video 
rates, i.e. 30 frames per second. This method requires 
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approximately 30 times the computation needed for 
MPEG-I I decompression. 

One possible way to achieve real-time video water- 
marking is to only watermark every N^*^ frame. However, 
content owners wish to protect each and every video 
frame. Moreover, if it is known which frames contain 
embedded signals, it is simple to remove those frames 
with no noticeable degradation in the video signal. 

In U.S. Patent Application Serial No. 08/715,953. 
filed September 19. 1996, entitled "Watermarking of 
Image Data Using MPEQ/SPEG Coefficients" by Cox, 
and incorporated herein by reference, there is 
described an alternative method, which is to insert the 
watermark into n x n blocks of the image (subimages) 
where n«N Then the computation cost is 

N 

- n log n s= A/ log n . 



For N^512x512^2^^ and n^8x8=2^, the asynp- 
totic saving is only a factor of 3. However, empirically the 
cost of computing the DCT over the entire image may 
be significantly higher when cache, loop unfolding and 
other efficiency issues are considered. Thus, the practi- 
cal difference may approach a 30 fold savings. More 
importantly, if the block size is chosen to be 8x8, i.e. the 
same size as that used for MPEG image compression, 
then it is possible to tightly couple the watermark inser- 
tion and extraction procedures to those of the MPEG 
compression and decompression algorithms. Consider- 
able computational saving can then be achieved since 
the most expenses computations relate to the calcula- 
tion of the DCT and its inverse and these steps are 
already conputed as part of the compression and 
decompression algorithm. The incremental cost of 
watermarking is then very small, typically less than 5% 
of the computational requirements associated with 
MPEG. 

The present invention improves the reliability of the 
invention described in the 08/715.953 application by 
storing watermark information into subimages. and 
extracting watermark information from subimages. in a 
manner different from that described earlier. 

The present invention improves the reliability of the 
prior systems by systematically varying the order in 
which watermark signal components are inserted into 
each subimage, by inserting only part of the watermark 
signal into each subimage, and. during watermark 
detection, by combining the watermark signals found in 
groups of subimages to reconstruct the original water- 
mark signal before testing for correlation with any pre- 
defined watermarks 

For detection, a reverse transformation is applied to 
each subimage to reconstruct the watermark informa- 
tion that was stored in that subimage. The resulting sig- 
nals are then averaged together to reconstruct the 
whole watermark, and to reduce noise. Finally, this 
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reconstructed watermark is compared against a prede- 
fined set of watermark signals to determine which one 
was inserted into the image. 

A principal object of the present invention is there- 

5 fore, the provision of inserting a subset of a watermark 
into a subset of subregions of data to be watermarked. 

Another object of the invention is the provision of a 
digital watermarking system in which a watermark is 
extracted by averaging the watermarked signal from 

10 subregions of watermarked data, and then correlating 
the resulting signal to determine the watermark. 

A further object of the invention is the provision of a 
digital watermarking system in which the v^termark is 
composed of two portions, a verification portion and a 

75 synchronization portion, in order to improve watermark 
extraction reliability 

Further and still other objects of the invention will 
become more clearly apparent when the following 
description s read in conjunction with the accompany- 

20 ing drawing. 

Brief Description of the Drawings 

Figure 1 is a schematic block diagram of watermark 

25 insertion procedure; 

Rgure 2 is a schematic block diagram of a water- 
mark insertion procedure in accordance with the 
teachings of the present invention; 
Figure 3 is a schematic block diagram of a water- 

30 mark extraction procedure; 

Figure 4 is a schematic block diagram of a water- 
mark extraction procedure in accordance with the 
teachings of the present Invention; 
Figure 5 is a graphic representation of a zigzag pat- 

35 tern useful for vectorizing subimages; 

Figure 6 is a graphic representation of rotation of 
PN sequences; 

Rgure 7 is a graphical representation of an 8x8 
block shown the spatial relation of averaged terms; 
40 Figure 8 is a schematic block diagram of a method 
for inserting watermarks in accordance with the 
present invention; and 

Figure 9 is a schematic block diagram of a rriethod 
for extracting watermarks In accordance with the 
45 present invention. 

Dgtaiigij Pescriptipn 

Referring now to the figures, and Figures 1 through 
50 4 in particular, there is shown schematic block diagrams 
of a general method for inserting and detecting water- 
marks in digital data, for instance images. 

In the following description, reference may be made 
to Image data or Images. While the invention has appli- 
55 cability to image data and images, it will be understood 
that the teachings herein and the invention itself are 
equally applicable to video, image and multimedia data 
and the term "image" and "image data" will be under- 
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stood to include these terms where applicable. As used 
herein, "watermark" will be understood to Include 
embedded data, symbols.-fmages, instructions or any 
other identifying information. 

In the following description, reference is made to s 
procedures described in U.S. Patent Application Serial 
No. 08/534,894 for inserting and extracting or detecting 
a watermark in images as INSERT-ORIGINAL and 
EXTRACT-ORIGINAL, respectively Reference Is made 
to procedures described in U.S. Patent Application io 
Serial No. 08/708,331 filed September 4, 1996 for 
inserting and extracting or detecting watermarks in 
images as INSERT-WHOLE and EXTRACT-WHOLE, 
respectively. And reference is made to procedures 
described in U.S. Patent Application Serial No. is 
08/715,953 for inserting and extracting or detecting 
watermarks in images as INSERT-MPEG-A and 
EXTRACT-MPEG-A. respectively 

Figure 1 shows a scherratic block diagram of 
INSERT-WHOLE procedure for inserting watermarks 20 
into images. The watennark signal, in ttie form of a finite 
sequence of symbols chosen from an alphabet, is pro- 
vided as an input to an error correction encoder 10 
which transforms this sequence into another sequence 
that contains redundant information. The output of 25 
encoder 1 0 Is provided to a PN-mappef 1 1 , which maps 
each symbol of the encoded watermark into a pre-spec- 
ified pseudo-random noise (PN) code. The output of the 
PN-mapper 11 Is provided to a spectral transformer 12, 
which converts the pseudo-random noise sequence into so 
the frequency domain. The conversion preferably is by 
discrete cosine transform (DCT), however, fast fourier 
transform, wavelet type decomposition and the like may 
also be used for frequency conversion. Concurrently, 
the data to be watermarked is provided to another spec- 3S 
tral ta'ansfbrmer 13. The outputs of the two spectral 
transformers 12 and 13 are tiien provided as inputs to a 
spectral shaper 14, which modifies tiie spectral proper- 
ties of the pseudo-random noise codes from spectral 
transformer 12 to mask the watermark when added to 40 
the image data The spectrally transformed data to be 
watermarked, from spectral transformer 13, is also pro- 
vided as an input to a delay 15. The output of the spec- 
tral shaper 14 is then added to the output of delay 15 at 
a summer 16. The summer output is subject to an 45 
inverse transform 1 7. The result of the inverse transform 
is watermarked data. 

INSERT-MPEG-A differs from INSERT-WHOLE by 
segmenting the data to be watermarked Into multiple 
blocks, such as 8x8 pixel subimages or subregions. so 
Each block of data then has tiie watermark inserted 
according to the above described method. That is, for 
each 8x8 subimage or subregion, a pseudo-random 
number (PN) sequence is inserted into the DCT coeffi- 
cients after suitable spectral shaping. The procedure is ss 
repeated for all such subimages or subregions. The size 
of tine subimage or subregion Is preferably 8x8, but It 
can be of other sizes, such as 2x2, 3x3, 4x4 or 16x16. 



Figure 2 shows a schematic block diagram of a 
watermark insertion procedure in accordance with 
teachings of the present invention. The watermark sig- 
nal is processed into a noise spectrum^ signal by tiie 
error correction encoder 20, the PN mapper 21. and the 
specti-al transformer 22, in the same manner as 
described in conjunction with Figure 1 . However, unlike 
INSERT-WHOLE or INSERT-MPEG-A, the watermark is 
then used as an Input to a watermark segmenter 23. 
which systematically separates the watermark into sev- 
eral subwatermarks. Any portion of the original water- 
mark might appear redundantly in several of the 
resulting subwatermarks. Concurrentiy, the data to be 
watermarked is used as an input to data segmenter 24. 
which segments the data into blocks or subregions. 
such as 8x8 subimages, as In INSERT-MPEG-A. Each 
of tiie subwatermarks output by the watermark seg- 
menter 23 is then inserted into a data block by one of 
the watermark inserters 25a. 25b, etc. The procedure 
used by tiie watermark inserters 25a, 25b, etc., is the 
same procedure described connection with watermark 
inserter 18 in Figure 1. That is. each subwatermark is 
added into a spectrally transformed data block after 
specti^l shaping, and the resulting data is then trans- 
formed back into the spatial domain. Rnally, the water- 
marked data blocks from the watermark inserters 25a, 
25b, etc., are assembled by data combiner 26 to pro- 
duce watermarked data 

Rgure 3 shows a schematic block diagram of the 
EXTRACT-WHOLE procedure. TTie watennarked 
image, video or multimedia data is first used as input 
into a spectral normalizerSO to undo any previously per- 
formed spectral shaping. If tiie data contains a water- 
mark, ttien tiie output of tiie spectral normalizer 30 will 
resemt>le the spectral transformation of the PN coding 
of that watermark (the signal that was input to tiie spec- 
tral shaper 14 in Rgure 1). The output of the spectral 
normalizer 30 is then used as an input to several corre- 
lators 31a, 31b, etc., which test the watermark with tiie 
PN codes us^ to represent tiie various symbols that 
tiie encoded watermark might contain (i.e. each conrela- 
tor tests for one PN code tiiat is used to encode a sym- 
bol by ttie PN maw^er 1 1 of Figure 1). The outputs of tiie 
correlators 31a, 31b, etc., are used as inputs to a deci- 
sion circuit 32. which determines the most likely 
sequence of symbols. Rnally. this sequence is cor- 
rected by an, error corrector 33, which performs tiie 
inverse coding that was performed by the enor conec- 
tion encoder 10 In Figure 1. The result Is the extracted 
watermark. 

In EXTRACT-MPEG-A. tiie data from which a 
watermark is to be extracted Is first segmented into sev- 
eral blocks, such as 8x8 subimages, exactly as in 
INSERT-MPEG-A. The signal from each subimage is 
tiien normalized and i©ed as input into a tank of corre- 
lators similar to the con-etators 31a. 31b. eto. in Rgure 
3. The output from tiie correlators is then averaged witii 
tiie outputs of corresponding correlators from other 
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subimages, and the resulting average correlations are 
used as inputs into the decision circuit 32 for subse- 
quent processing as described above. 

Figure 4 shows a schematic block diagram of a 
watermark extraction procedure In accordance with the 
teachings of the present invention. The watermarked 
data is first segmented into blocks by data segmenter 
40. which corresponds to the data segmenter 24 used 
during the insertion procedure in Figure 2. Each of the 
data blocks is provided to a respective spectrum nor- 
malizer 41a. 41b. etc. to produce a signal resembling 
the subwatermark that was inserted into the respective 
data block. These inserted subwatermark signals are 
then used as inputs into a watermark combiner 42. In 
the combiner 42, parts of the watermark that appear 
redundantly In several subwatermarks are averaged 
together to reduce noise. The output of the watermark 
combiner 42 is provided as the input to a symbol sepa- 
rator 43 which divides the watermark into parts, each of 
which conesponds to one symbol from the encoded 
watermark signal (the output of error correction encoder 
20 in Figure 2). 

These symbols from separator 43 are provided as 
inputs to respective watermark identifiers 44a. 44b etc. 
each of which includes of a bank of con-etators and a 
decision circuit, as shown in Figure 3. The outputs of the 
watermark identifiers are symbols from the alphabet 
used in the original encoded watermark signal. The 
identified symbols are reassembled into a complete 
encoded watermark by the symbol combiner 45. Finally, 
the resulting encoded watermark is decoded by the 
error corrector 46. . 

The insertion and extraction procedures will now be 
described in more detail. In INSERT-ORIGINAL and 
EXTRACT-ORIGINAL, the object is to embed a single 
PN (pseudo random number) sequence irrto an image 
when the original image is available at the time of 
extraction. The information associated with the PN 
sequence is assumed to be stored In a database 
together with the original image and the spectral loca- 
tion of the embedded watermark. The locations of the 
watermarked components has to be recorded because 
the implementation approximated the N perceptually 
most significant regions of the watermark by the N larg- 
est coefficients. However, this ranking was not invariant 
to the watermarking process. The N largest coefficients 
may be different after inserting the watermark than 
before Inserting the watermark. 

In order to avoid this problem, the present Invention 
places a watermark in predetermined locations of the 
spectrum, typically the first N coefficients. However, any 
predetermined locations could be used, though such 
locations should belong to the perceptually significant 
regions of the spectrum If the watermark is to survive 
common signals transformations such as conrpression, 
scaling, etc. 

More generally the information to be embedded is 
a sequence of m symbols drawn from an alphabet A 



(e.g. the binary digits or the ASCII symbols). This data 
is then supplemented with additional symbols for error 
detection and correction. Each symbol is then spread 
spectrum modulated, a process that maps each symbol 

5 into a unique PN sequence known as a chip. The 
number of bits per chip is preset - the longer the chip 
length, the higher the detected signal-to-noise ratio will 
be, but this is at the expense of signaling bandwidth. 
The power spectrum of the PN sequence is white, 

10 i.e. flat, and is therefore shaped to match that of the 
. "noise", i.e. the ImageAndeo/audio/or multimedia data 
into which the watermark is to be embedded. It is this 
spectral shaping that must be modified from the prior 
methods so that the extraction process no longer 

1$ requires the original image. To do this, each coefficient 
of the watermarked sp>ectrum is scaled by the local 
average of the power in the image spectral coefficient 
rather than the coefficient itself, i.e. 

20 f]-fi^aavg[\fj\)W, (1) 

The averaging is the averaging of the absolute coef- 
ficient values and not the coefficient values themselves. 
This is effectively estimating the average power present 

25 at each frequency Other averaging procedures are pos- 
sible, for example, averaging over several frames or 
average of local neighborhoods of 8x8 blocks. 

This average may be obtained in several ways. It 
may be a local average over a two dimensional region. 

30 Alternatively the two dimensional spectrum may be 
sampled to form a one dimensional vector and a one 
dimensional local average may be performed. One 
dimensional vectorization of the two dimensional 8x8 
DCT coeffidents is already performed as part of MPEG 

35 Il.The average may be a simple box or weighted aver- 
age over the neighborhood. 

For video data, temporal averaging of the spectral 
coefficients over several frames can also be applied. 
However, since several frames are needed for average 

40 ing at the spectral normalization stage of the extractor, 
the protection of individual video frames taken in isola- 
tion may not be possible. For this reason, the present 
Invention treats video as a very large collection of still 
images. In this way, even individual video frames are 

45 copy protected- 

In order to extract the waternrBrk, it is necessary to 
perform the spectral normalization, in which the previ- 
ously performed spectral shaping procedure Is inverted. 
In the present Invention, the original unwatermarked 

so signal is not available. Thus, the average power of the 
frequency coefficients, avg{\fj\),\B approximated by the 
average of the watermarked signal, I.e. avg{\ff\) 

as^g{\ff\) t= avg[\fj\) (2) 

55 

This is approximately true since aav/p(|f/|) VV-«/-. 
where Wj is the watermark component, and a is a con- 
stant typically in the range between 0.1 and 0.01 . 
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The normalization stage then divides each coeffi- 
cient {fi ) In the received signal by the local average avg 
(|f/D in the neighborhood. 

That is, 



The first term, on the right hand side (RHS) of io 
&1uation (3) , 



15 



is considered a noise term. This tenn was not present in 
the system desaibed U.S. Patent Application Serial No. 
08/534.894, because access to the unwatermarked 20 
coefficients allowed this term to be removed. The sec- 
ond term a Wj is the original watermark signal which can 
now be detected using conventional correlation. 

If the watermark is extracted from any single 8x8 
block, the detector reliability is very low. It however, the 25 
watermarks extracted from each 8x8 block are first 
added together and the averaged watermark is then 
applied to the correlator, then a very strong and unam- 
biguous response is obtained. This differs from the 
method described in U.S. Patent Application Serial No. so 
08/715,953 in which con-elation occun-ed within each 
block and the output from each correlator was averaged 
together. The present invention was found to improve 
the detection response and significantly reduced the 
computation requirement associated with each block. 3S 

In practicing the present invention preferably there 
is a unique PN sequence for each symbol in the alpha- 
bet. The method is relatively robust to clipping since the 
detector output reduces linearly with the quantity of 8x8 
subimage blocks in the image. For DVD (digital video 40 
disk) embedded signaling for APS (analog protection 
system) and CGMS (copy generation management sys- 
tem), there would be a total of 8 or 16 PN sequences. 

The number of 8x8 blocks in a 512x512 image is 
4096, suggesting that significantly more than one of 16 4S 
symbols can be embedded in an image or video frame. 
/\ssume. for example, that it is desired to embed 1 out of 
128 symbols in an image. It is necessary to perform 1 28 
parallel correlations. This is computationally tractable 
but hardware implementations of each con-elation so 
become more complex. An alternative method is to only 
use two binary symbols. It may be preferable to associ- 
ate more than one PN sequence with each of the two 
binary symbols or bits in order to increase the difficulty 
of intentionally removing the watermark, in this case, 55 
there are only two con-elators and a binary string may 
be embedded into the image. The raw bit error rate will 
be very high due by the low detector output However, 



this can be reduced to acceptable levels by using en-or 
correcting codes, such as Reed-Solomon (RS). RS 
codes are robust to burst error which may occur 
because of dipping of the image. Other error correcting 
codes may also be used. 

When using this method, it is necessary for the 
receiver to know the start location of the encoded block. 
The start location may not be obvious, particularly when 
the image has been subjected to clipping. However, 
convention synchronizing methods can be used; such 
as preceding each block with a special or unique sym> 
bol or string of symbols. 

. To insert a watermark, each 8x8 block is treated as 
an individual subimage or subregion. Tile DCT of the 
subimage is then computed and the two dimensional 
DCT is vectorized in the zigzag pattern shown in Rgure 
5, although other patterns are also possible. These two 
stages constitute most of the calculations but are part of 
the MPEG encoding process. Next, a PN noise 
sequence {W| ...Wnj is inserted into the DCT coefficients 
using Equation 1 as before. The length of the PN 
sequence cannot exceed 64 (in an 8 x 8 block) and is 
typically much shorter, in the range of 1 1 to 25. If only a 
single code is to be inserted into the image, then the 
same PN sequence is inserted into each of the 720 x 
480/64 - 5400 blocks. However, a variation may be per- 
formed at this point in the procedure. Within each row of 
blocks, the PN sequence is cyclically rotated by one fre- 
quency coefficient prior to insertion in the subsequent 
block. Similariy. the PN sequence is cyclically rotated by 
one frequency coefficient at the start of each new row. 
Figure 6 illustrates an order of rotations. 

The purpose of these rotations or shifts is to 
improve the response of the watermark extraction 
stage. Earlier experiments revealed that certain DCT 
coefficients were more difficult to estimate than others. 
The location at these coefficients varied from image to 
image. However, within an Image, the coefficient could 
be consistentiy poor. Consequently, without shifting, 
one or more of the estimated watermark coefficients 
could be significantly degraded relative to tiie other 
watermark coefficients, thereby reducing tfie detector 
performance. Conversely, shifting significantiy reduces 
the effect a poor DCT coefficients has on a single water- 
mark coefficient and tiie detector performarice is mark- 
edly improved. Note that any cyclic pattern can be used. 

Further modifications are useful once rotation of the 
watermark has been introduced. First, the length of the 
watermark may now be significantiy greater than 64. 
Then, for each block only a small subset of tiie water- 
mark (say five) coefficients is inserted into the first five 
DCT coefficients (excluding tiie d.c. term). Because of 
the rotation, a different subset of tiie watermark is 
inserted into neighboring 8x8 blocks. Finally, having 
completed the watermark insertion, the MPEG encoder 
Is able to proceed witii the sutssequent stages of com- 
pression. 

Note that the watermark may also be inserted after 
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the MPEG quantization stage to reduce distortion of the 
watermark. MPEG-2 performs a convenient one dimen- 
sion vectorization called "zigzagging", which allows a 
simple 3x1 box average to be performed on the coeffi- 
cients (excluding the d.c. term). 

In practice, performance was improved if the aver- 
aging is performed using the 2 four-connected coeffi- 
cients closest to the d.c. term, as illustrated in Figure 7, 
i.e. the two coefficients above and to the left. 

Watermark detection begins by first extracting the 
FN noise sequence from each 8x8 block using Equation 
1 . For each block, the PN sequence is then cyclically 
shifted in the opposite direction by one frequency coef- 
ficients, and the average over all the blocks is then com- 
puted. In practice, this process can be computed 
incrementally and does not require temporary storage 
of all the extracted watermarks. A weighted averaging 
can also be applied, where the weights are determined 
based on their susceptibility to common signal transfor- 
mations such as low pass filtering. Rnally, the average 
watermark is compared with the original PN sequence 
via correlation. The reason for shifting the watermark in 
the column direction may now be apparent. If the image 
is clipped on an arbitrary block boundary, then the com- 
puted average watermark will sinrply be rotated by an 
amount that is a function of the relative location of the 
clipped portion of the image. Correlation can then be 
performed on all permutations (typically 1 1 to 25) of the 
watermark. The output from the correlator with the max- 
imum value is then used for decision purposes. The 
extraction stage is depicted in Figure 4. Taking the max- 
imum correlator output over all rotations of the water- 
mark can cause the decision circuitry to be noisy. To 
improve this, the watermark is broken into two pieces; a 
synchronisation portion is of length K and a verification 
portion is N-K. Then, when the watennark is extracted 
as before, correlation is first performed only on all rota- 
tions of the synchronization portion of this watermark. 
The maximum correlation output is noted, then tile veri- 
fication portion of the watermark is rotated by the corre- 
sponding amount and a second correlation is performed 
on the verification portions of the watermarks. This 
. process significantly improves the overall reliability of 
the system. In the course of experimentation, it was 
noticed tiiat some watermarks performed better than 
others on the same imagery. This was caused by varia- 
tion in the correlation statistics between the synchroni- 
zation and verification portions of tiie watermark. 
Ideally, the two portions should have very fow correla- 
tions. However, in several cases where watermarks per- 
formed poorly, it was traced to unexpected correlations 
between the two portions. 

The present invention provides a modification to 
digital watermarking methods in which the original data 
is required for watermark extraction thereby enabling 
watermarking extraction in the absence of an unwater- 
marked or original data. The present invention prefera- 
bly uses MPEG/JPEG coefficients. An image is divided 



into typically 8x8 block subimages or subregions and 
each subimage is processed and the results are com- 
bined to derive tiie extracted watermark. The result is 
extraction of the watermark with very high confidence. 

5 While the above invention describes improvements 
to the prior-art INSERT-WHOLE. INSERT-MPEG-A. 
EXTRACT WHOLE, and EXTRACT-MPEG- A algo- 
rithms, it should be apparent to anyone skilled in the art 
that the same improvements may be applied to any 
.10 algorithm for inserting and extracting watermarks in 
image data. This more general view of the present 
invention is shown in Figures 8 and 9. 

Figure 8 shows a schematic block diagram of the 
general method for inserting watermarks. This general 

15 method makes use of a non-block-based watermark 
insertion algoritiim. which shall be referred to hereafter 
as the "base insertion algorithm". The watermark 
encoder 80 converts the watermark into a form appro- 
priate for the base insertion algorithm. If the base inser- 

20 tion algorithm is that shown in Figure 1 . for example, 
tiien tiie watermark encoder 80 con-esponds to the 
watermark encoder 19, which comprises the error cor- 
rection encoder 10. the PN mapper 1 1 , and the spectral 
transformer 12. However, if a different base insertion 

25 algoritiim is to be used, then the watermark encoder 80 
may perform a different ti-ansformation of the water- 
mark. The encoded watermark signal from watermark 
encoder 80 is provided as an input to watermark seg- 
menter 81. which divides the watermark into a set of 

30 subwatemnarks. Any portion of the original watermark 
might appear redundantly in several of the resulting 
subwatermarks. The data to be watermarked is pro- 
vided as an input to data segmenter 82. which divides 
the data into subregions. Each subwatermark is 

35 inserted into a respective data subregion by a water- 
mark inserter 83a, 83b, etc. The watermark inserters 
implement the base insertion algorithm, so, if the base 
insertion algorithm is that shown in Figure 1 , then each 
watermark Inserter 83a. 83b, etc.. corresponds to the 

40 watermark inserter 18, which comprises a spectral 
transformer 13. a spectral shaper 14. a delay 15. a sum- 
mer 15. and an inverse transform 17. However, if a dif- 
ferent base insertion algoritiim is to be used, then the 
watermark inserters 83a. 83b, etc., may employ a differ- 

45 ent method of inserting subwatermarks into the subre- 
gions of the data to be watermarked. The outputs from 
the watermark inserters are assembled in data com- 
biner 84 to provide watermarked data. 

Rgure 9 shows a schematic t>lock diagram of tiie 

50 corresponding general extraction algorithm. The algo- 
rithm makes use of a t>a'se extraction" algorithm that 
corresponds to the base insertion algorithm used in 
inserting tiie watermark into tiie data to be watermarked 
(Figure 8). The algorithm in Figure 9 is substantially the 

55 same as the algoritiim shown in Figure 4, except that in 
the general case, the spectrum normalizers 41 a, etc. 
are replaced by watermark extractors 91a. etc., which 
implement the base exto-action algorithm. That is, if the 
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base Insertion algorithm used was the algorithm shown 
in Figure 1, then the waterniark extractors 91a. etc., in 
Figure 9 will be the spectrum nomializers 41a etc. in 
Rgure 4. 

Whi}e there has been described and illustrated a s 
system for inserting a watermark Into and extracting a 
watermark from watermarked data without using an 
unwatermarked version of the data, it will be apparent to 
those skilled In the art that variations and modifications 
are possible without deviating from the broad principles io 
and teachings of the present invention which shall be 
limited solely by the scope of the claims appended 
hereta 

Claims is 

1. A method for inserting a watermark signal into data 
to be watermarked comprising the steps of: 

dividing data to be watermarked into a plurality 20 
of subregions; 

computing frequency coefficients of the data to 
be watermarked in each subregion; 
spread spectrum modulating a watermark sig- 
nal to be inserted t>y mapping the watermark 25 
signal Into a PN (pseudo-random noise) 
sequence; 

spectral shaping the PN sequence as a func- 
tion of the average power in each frequency 
coefficient of the data; and so 
inserting each spectral shaped PN sequence 
into predetermined coefficients in the data In 
each subregion. 

2. A method for inserting a watermark signal into data 3S 
to be watermarked as set forth in daim 1. where 
said frequency coefficients are DCT (discrete 
cosine transform) coefficients. 

3. A method for inserting a watermark signal into data 40 
to be watermarked as set forth in claim 1 or 2, 
where each subregion is a 8 x 8 block of pixels. 

4. A method for inserting a watermark signal into data 

to be watermarked as set forth in claim 3,where the 4S 
frequency coefficients of the watermark signal are 
rotated prior to inserting of each spectral shaped 
PN sequence into the subregion. 

& A method for inserting a watermark signal into data so 
to be watermarked as set forth in claim 4, where 
only a subset of the watermark signal frequency 
coefficients is inserted into any one subregion. 

6. A method for inserting a watermark signal Into data ss 
to be watermarked as set forth in claim 5 , where 
the watermark signal comprises a synchronization 
portion and a verification portion. 



7. A method for inserting a watermark signal into data 
to be watermarked as set forth in any of claims 1 to 
S, where said inserting is performed after the data 
undergoes MPEG quantization processing. 

8. A metiiod for inserting a watermark signal into data 
to be watermarked as set forth in claims 6 or 7, 
where tiie synchronisation portion and tiie verifica- 
tion portion have vecy iitUe con-elation t}etween 
each other. 

9. A mettiod for inserting a watermark signal Into data 
to be watermarked as set fortii in any of claims 1 to 
8. 

where the spectral shaping as a function of tiie 
average power is typically 3 x 1 window of the 
coefficient obtained from the one-dimensional 
vectorization by zigzagging of two<timension 
frequency coefficients. 

1 0. A method for inserting a watermark signal Into data 
to be watermarked as set forth in any of dainre 1 to 
8, where the ^ectral shaping is a function of the 
average power based on the two four-connected 
frequency coefficients closest to the DC term. 

11. A method of extracting a watermark from water- 
marked data comprising the steps of: 

receiving subregions of watermarked data; 
specti'um normalizing the watermarked data 
from each subregion to generate respective 
normalized signals; 

combining the respective normalized signals 
from each subregion to generate a single 
watermark; 

correlating the single watermark with predeter- 
mined PN (pseudo-random noise) sequences 
corresponding to predetermined symbols to 
provide coaeiated signals for each predeter- 
mined PN sequence in each subregion: 
deciding which correlated signal is most likely a 
current symbol; and extracting a sequence of 
most likely cun-ent symbols con-esponding to 
the watermark, 

12. A method of extiacting a watermark from water- 
marked data as set forth In claim 1 1 , where the sub- 
regions are 8 X 8 blocks used for MPEG encoding 
and decoding. 

13. A method of extracting a watermark from water- 
marked data as set forth in claim 12. where said 
combining the normalized signals from each subre- 
gion to generate a single watermark, including 
removing the relative rotation of tiie watermark 
between blocks. 
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14. A method of extracting a watermark Irom water- 
marked data as set lorth in claim 13, further com- 
prising subsequently reconstructing the waternnark 
from partial watermarks inserted into each block. 

15. A method of extracting a watermark from water- 
marked data as set forth in claim 14. further com- 
prising weighting the watermark coefficients tased 
on their location within the frequency spectrum, 
where the weighting is a function of the susceptibil- 
ity of each frequency coefficient to common signal 
transformations. 

16. A method of extracting a watermark from water- 
marked data as set forth in claim 11. further com- 
prising correlating with all rotational shifts of the 
extracted watermark and selecting the maximum 
value. 

17. A method of extracting a watermark from water- 
marked data as set fortii in claim 11, further com- 
prising correlating with all rotational shifts of a 
synchronization portion of a watermark to deter- 
mine a maximum value and subsequently rotating a 
verification portion of the watermark by the same 
amount as the synchronization portion is rotated to 
obtain the maximum value prior to correlating 
between tine verification portion and predetermined 
PN sequences. 

18. A method of extracting a watermark from water- 
marked data comprising the steps of: 

receiving subregions of watermarked data; 
spectrum normalizing the waternrarked data 
from each subregion to generate respective 
normalized signals; 

correlating the respective normalized signals 
with predetermined PN sequences corre- 
sponding to predetermined symbols to provide 
conrelated signals for each predetermined PN 
sequence in each subregion; 
deciding which con-elated signal is most likely a 
current symbol in each subregion for providing 
an exti'acted symbol stream; 
enror con-ecting the extracted symbol stream; 
and 

exti^acting a sequence of most likely current 
symbols con-esponding to tiie watermark. 

19. A method of extracting a watermark from water- 
marked data as set forth in claim 18, where said 
en-or correction is Reed Solomon en-or correction. 

20. A method for inserting a watermark signal into data 
to be watermarked comprising the steps of: 

dividing data to be watermarked into a plurality 



of subregions; 

dividing a watermark signal into a plurality of 
subwatermarks where portions of the water- 
mark are contained in more than one subwater- 
5 mark; and 

inserting said plurality of subwatermarks into 
said plurality of subregions. 

21 . A method for inserting a watermark signal into data 
10 to be watermarked as set forth in claim 20, where 
each subwatermark is inserted into a respective 
subregion. so tiiat each subregion contains at least 
one subwatermark. 

75 22. A metiiod for extracting a watermark signal from 
watermarked data comprising tiie steps of: 

receiving a plurality of subregions of watermark 
data; 

20 extracting a subwatermark from each subre- 

gion of said plurality of subregions; and 
combining and averaging the subwatermarks 
extracted from all the subregions to obtain a 
signal commensurate with the watermark sig- 

25 nal. 

23. A method for extracting a watermark signal from 
watermarked data as set fortii in claim 22. further 
comprising tiie steps of: 

30 

dividing tine signal commensurate witin tiie 
watermark signal into a plurality of symbol sig- 
nals; 

correlating each symbol signal with a set of 

35 predefined signals; 

determining which predefined signal best cor- 
responds to each symbol signal; and 
combining the best corresponding predeter- 
mined signals to generate the watermark sig- 

40 nal. 
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